DOC: update intro and add link to preprint #80

GavinHuttley · 2024-11-11T20:46:21Z

Summary by Sourcery

Update the README to enhance the introduction of diverse_seq and include a link to a relevant preprint.

Documentation:

Updated the README to provide a more detailed introduction to the diverse_seq tool, highlighting its alignment-free algorithms and their application in phylogenetic workflows.
Added a link to a preprint for further reading on the methods implemented in diverse_seq.

sourcery-ai · 2024-11-11T20:46:24Z

Reviewer's Guide by Sourcery

This PR updates the README.md file to provide a more comprehensive and accurate description of the diverse_seq project, including its capabilities and performance characteristics. The changes replace the original brief introduction with a more detailed explanation and add a link to the project's preprint.

No diagrams generated as the changes look simple and do not need a visual representation.

File-Level Changes

Change	Details	Files
Updated project description to better reflect its capabilities and scope	Modified project tagline to emphasize alignment-free algorithms and phylogenetic workflows Added detailed performance metrics for genome analysis Included information about seed phylogeny generation Added reference to the project's preprint	`README.md`

Tips and commands

Interacting with Sourcery

Trigger a new review: Comment @sourcery-ai review on the pull request.
Continue discussions: Reply directly to Sourcery's review comments.
Generate a GitHub issue from a review comment: Ask Sourcery to create an
issue from a review comment by replying to it.
Generate a pull request title: Write @sourcery-ai anywhere in the pull
request title to generate a title at any time.
Generate a pull request summary: Write @sourcery-ai summary anywhere in
the pull request body to generate a PR summary at any time. You can also use
this command to specify where the summary should be inserted.

Customizing Your Experience

Access your dashboard to:

Enable or disable review features such as the Sourcery-generated pull request
summary, the reviewer's guide, and others.
Change the review language.
Add, remove or edit custom review instructions.
Adjust other review settings.

Getting Help

Contact our support team for questions or feedback.
Visit our documentation for detailed guides and information.
Keep in touch with the Sourcery team by following us on X/Twitter, LinkedIn or GitHub.

sourcery-ai

Hey @GavinHuttley - I've reviewed your changes and they look great!

Here's what I looked at during the review

🟢 General issues: all looks good
🟢 Security: all looks good
🟢 Testing: all looks good
🟢 Complexity: all looks good
🟡 Documentation: 1 issue found

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

sourcery-ai · 2024-11-11T20:47:04Z

README.md

-`diverse_seq` provides tools for selecting a representative subset of sequences from a larger collection. It is an alignment-free method which scales linearly with the number of sequences. It identifies the subset of sequences that maximize diversity as measured using Jensen-Shannon divergence. `diverse_seq` provides a command-line tool (`dvs`) and plugins to the Cogent3 app system (prefixed by `dvs_`) allowing users to embed code in their own scripts. The command-line tools can be run in parallel.
+`diverse-seq` implements computationally efficient alignment-free algorithms that enable efficient prototyping for phylogenetic workflows. It can accelerate parameter selection searches for sequence alignment and phylogeny estimation by identifying a subset of sequences that are representative of the diversity in a collection. We show that selecting representative sequences with an entropy measure of *k*-mer frequencies correspond well to sampling via conventional genetic distances. The computational performance is linear with respect to the number of sequences and can be run in parallel. Applied to a collection of 10.5k whole microbial genomes on a laptop took ~8 minutes to prepare the data and 4 minutes to select 100 representatives. `diverse-seq` can further boost the performance of phylogenetic estimation by providing a seed phylogeny that can be further refined by a more sophisticated algorithm. For ~1k whole microbial genomes on a laptop, it takes ~1.8 minutes to estimate a bifurcating tree from mash distances.
+
+You can read more about the methods implemented in `diverse_seq` in the preprint [here](https://biorxiv.org/cgi/content/short/2024.11.10.622877v1).


issue (documentation): Package name is inconsistently written as both diverse-seq and diverse_seq

Please standardize the package name throughout the documentation to avoid confusion.

coveralls · 2024-11-11T20:58:03Z

Pull Request Test Coverage Report for Build 11785928718

Warning: This coverage report may be inaccurate.

This pull request's base commit is no longer the HEAD commit of its target branch. This means it includes changes from outside the original pull request, including, potentially, unrelated coverage changes.

For more information on this, see Tracking coverage changes with pull request builds.
To avoid this issue with future PRs, see these Recommended CI Configurations.
For a quick fix, rebase this PR at GitHub. Your next report should be accurate.

Details

0 of 0 changed or added relevant lines in 0 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage remained the same at 91.892%

Totals
Change from base Build 11770334092:	0.0%
Covered Lines:	1190
Relevant Lines:	1295

💛 - Coveralls

DOC: update intro and add link to preprint

7140b16

GavinHuttley merged commit ac92055 into HuttleyLab:main Nov 11, 2024
9 of 10 checks passed

sourcery-ai bot reviewed Nov 11, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC: update intro and add link to preprint #80

DOC: update intro and add link to preprint #80

GavinHuttley commented Nov 11, 2024 •

edited by sourcery-ai bot

Loading

sourcery-ai bot commented Nov 11, 2024 •

edited

Loading

Interacting with Sourcery

Customizing Your Experience

Getting Help

sourcery-ai bot left a comment

sourcery-ai bot Nov 11, 2024

coveralls commented Nov 11, 2024

DOC: update intro and add link to preprint #80

DOC: update intro and add link to preprint #80

Conversation

GavinHuttley commented Nov 11, 2024 • edited by sourcery-ai bot Loading

Summary by Sourcery

sourcery-ai bot commented Nov 11, 2024 • edited Loading

Reviewer's Guide by Sourcery

File-Level Changes

Interacting with Sourcery

Customizing Your Experience

Getting Help

sourcery-ai bot left a comment

Choose a reason for hiding this comment

sourcery-ai bot Nov 11, 2024

Choose a reason for hiding this comment

coveralls commented Nov 11, 2024

Pull Request Test Coverage Report for Build 11785928718

Warning: This coverage report may be inaccurate.

Details

💛 - Coveralls

GavinHuttley commented Nov 11, 2024 •

edited by sourcery-ai bot

Loading

sourcery-ai bot commented Nov 11, 2024 •

edited

Loading